Overview
Brought to you by YData
Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 2139048 |
| Missing cells | 18314504 |
| Missing cells (%) | 29.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 473.3 MiB |
| Average record size in memory | 232.0 B |
Variable types
| DateTime | 2 |
|---|---|
| Categorical | 6 |
| Unsupported | 1 |
| Numeric | 8 |
| Text | 12 |
CONTRIBUTING FACTOR VEHICLE 4 is highly overall correlated with CONTRIBUTING FACTOR VEHICLE 5 | High correlation |
CONTRIBUTING FACTOR VEHICLE 5 is highly overall correlated with CONTRIBUTING FACTOR VEHICLE 4 | High correlation |
NUMBER OF CYCLIST KILLED is highly overall correlated with NUMBER OF PEDESTRIANS KILLED and 1 other fields | High correlation |
NUMBER OF MOTORIST INJURED is highly overall correlated with NUMBER OF PERSONS INJURED | High correlation |
NUMBER OF MOTORIST KILLED is highly overall correlated with NUMBER OF PERSONS KILLED | High correlation |
NUMBER OF PEDESTRIANS KILLED is highly overall correlated with NUMBER OF CYCLIST KILLED and 1 other fields | High correlation |
NUMBER OF PERSONS INJURED is highly overall correlated with NUMBER OF MOTORIST INJURED | High correlation |
NUMBER OF PERSONS KILLED is highly overall correlated with NUMBER OF CYCLIST KILLED and 2 other fields | High correlation |
NUMBER OF PEDESTRIANS KILLED is highly imbalanced (99.6%) | Imbalance |
NUMBER OF CYCLIST INJURED is highly imbalanced (92.0%) | Imbalance |
NUMBER OF CYCLIST KILLED is highly imbalanced (99.9%) | Imbalance |
CONTRIBUTING FACTOR VEHICLE 4 is highly imbalanced (90.9%) | Imbalance |
CONTRIBUTING FACTOR VEHICLE 5 is highly imbalanced (90.1%) | Imbalance |
BOROUGH has 664048 (31.0%) missing values | Missing |
ZIP CODE has 664310 (31.1%) missing values | Missing |
LATITUDE has 239440 (11.2%) missing values | Missing |
LONGITUDE has 239440 (11.2%) missing values | Missing |
LOCATION has 239440 (11.2%) missing values | Missing |
ON STREET NAME has 458746 (21.4%) missing values | Missing |
CROSS STREET NAME has 815476 (38.1%) missing values | Missing |
OFF STREET NAME has 1772675 (82.9%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 2 has 336447 (15.7%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 3 has 1985155 (92.8%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 4 has 2104055 (98.4%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 5 has 2129500 (99.6%) missing values | Missing |
VEHICLE TYPE CODE 2 has 417714 (19.5%) missing values | Missing |
VEHICLE TYPE CODE 3 has 1990930 (93.1%) missing values | Missing |
VEHICLE TYPE CODE 4 has 2105307 (98.4%) missing values | Missing |
VEHICLE TYPE CODE 5 has 2129794 (99.6%) missing values | Missing |
LATITUDE is highly skewed (γ1 = -20.03202737) | Skewed |
NUMBER OF PERSONS KILLED is highly skewed (γ1 = 33.18090186) | Skewed |
NUMBER OF MOTORIST KILLED is highly skewed (γ1 = 53.52641947) | Skewed |
COLLISION_ID has unique values | Unique |
ZIP CODE is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
NUMBER OF PERSONS INJURED has 1636505 (76.5%) zeros | Zeros |
NUMBER OF PERSONS KILLED has 2135856 (99.9%) zeros | Zeros |
NUMBER OF PEDESTRIANS INJURED has 2020475 (94.5%) zeros | Zeros |
NUMBER OF MOTORIST INJURED has 1819309 (85.1%) zeros | Zeros |
NUMBER OF MOTORIST KILLED has 2137793 (99.9%) zeros | Zeros |
Reproduction
| Analysis started | 2024-12-04 17:58:22.154853 |
|---|---|
| Analysis finished | 2024-12-04 18:00:52.508173 |
| Duration | 2 minutes and 30.35 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
CRASH DATE
Date
| Distinct | 4536 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.3 MiB |
| Minimum | 2012-07-01 00:00:00 |
|---|---|
| Maximum | 2024-11-30 00:00:00 |
CRASH TIME
Date
| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.3 MiB |
| Minimum | 2024-12-04 00:00:00 |
|---|---|
| Maximum | 2024-12-04 23:59:00 |
BOROUGH
Categorical
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 664048 |
| Missing (%) | 31.0% |
| Memory size | 16.3 MiB |
| BROOKLYN | |
|---|---|
| QUEENS | |
| MANHATTAN | |
| BRONX | |
| STATEN ISLAND |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 7.4519586 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BROOKLYN |
|---|---|
| 2nd row | BROOKLYN |
| 3rd row | BRONX |
| 4th row | BROOKLYN |
| 5th row | MANHATTAN |
Common Values
| Value | Count | Frequency (%) |
| BROOKLYN | 470551 | |
| QUEENS | 395650 | |
| MANHATTAN | 328674 | |
| BRONX | 218295 | 10.2% |
| STATEN ISLAND | 61830 | 2.9% |
| (Missing) | 664048 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| brooklyn | 470551 | |
| queens | 395650 | |
| manhattan | 328674 | |
| bronx | 218295 | |
| staten | 61830 | 4.0% |
| island | 61830 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1865504 | |
| O | 1159397 | |
| A | 1109682 | |
| E | 853130 | 7.8% |
| T | 781008 | 7.1% |
| R | 688846 | 6.3% |
| B | 688846 | 6.3% |
| L | 532381 | 4.8% |
| S | 519310 | 4.7% |
| Y | 470551 | 4.3% |
| Other values (9) | 2322984 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10991639 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 1865504 | |
| O | 1159397 | |
| A | 1109682 | |
| E | 853130 | 7.8% |
| T | 781008 | 7.1% |
| R | 688846 | 6.3% |
| B | 688846 | 6.3% |
| L | 532381 | 4.8% |
| S | 519310 | 4.7% |
| Y | 470551 | 4.3% |
| Other values (9) | 2322984 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10991639 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 1865504 | |
| O | 1159397 | |
| A | 1109682 | |
| E | 853130 | 7.8% |
| T | 781008 | 7.1% |
| R | 688846 | 6.3% |
| B | 688846 | 6.3% |
| L | 532381 | 4.8% |
| S | 519310 | 4.7% |
| Y | 470551 | 4.3% |
| Other values (9) | 2322984 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10991639 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 1865504 | |
| O | 1159397 | |
| A | 1109682 | |
| E | 853130 | 7.8% |
| T | 781008 | 7.1% |
| R | 688846 | 6.3% |
| B | 688846 | 6.3% |
| L | 532381 | 4.8% |
| S | 519310 | 4.7% |
| Y | 470551 | 4.3% |
| Other values (9) | 2322984 |
ZIP CODE
Unsupported
Missing  Rejected  Unsupported 
| Missing | 664310 |
|---|---|
| Missing (%) | 31.1% |
| Memory size | 16.3 MiB |
LATITUDE
Real number (ℝ)
Missing  Skewed 
| Distinct | 127648 |
|---|---|
| Distinct (%) | 6.7% |
| Missing | 239440 |
| Missing (%) | 11.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.623741 |
| Minimum | 0 |
|---|---|
| Maximum | 43.344444 |
| Zeros | 4677 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 40.596511 |
| Q1 | 40.66758 |
| median | 40.720567 |
| Q3 | 40.769623 |
| 95-th percentile | 40.86194 |
| Maximum | 43.344444 |
| Range | 43.344444 |
| Interquartile range (IQR) | 0.102043 |
Descriptive statistics
| Standard deviation | 2.0197808 |
|---|---|
| Coefficient of variation (CV) | 0.049719222 |
| Kurtosis | 399.91004 |
| Mean | 40.623741 |
| Median Absolute Deviation (MAD) | 0.0513168 |
| Skewness | -20.032027 |
| Sum | 77169184 |
| Variance | 4.0795145 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4677 | 0.2% |
| 40.861862 | 918 | < 0.1% |
| 40.696033 | 793 | < 0.1% |
| 40.8047 | 693 | < 0.1% |
| 40.608757 | 681 | < 0.1% |
| 40.798256 | 635 | < 0.1% |
| 40.759308 | 633 | < 0.1% |
| 40.6960346 | 587 | < 0.1% |
| 40.675735 | 585 | < 0.1% |
| 40.658577 | 544 | < 0.1% |
| Other values (127638) | 1888862 | |
| (Missing) | 239440 | 11.2% |
| Value | Count | Frequency (%) |
| 0 | 4677 | |
| 30.78418 | 1 | < 0.1% |
| 34.783634 | 1 | < 0.1% |
| 40.498947 | 1 | < 0.1% |
| 40.4989488 | 2 | < 0.1% |
| 40.4991346 | 1 | < 0.1% |
| 40.49931 | 1 | < 0.1% |
| 40.4994787 | 1 | < 0.1% |
| 40.499659 | 1 | < 0.1% |
| 40.499672 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 43.344444 | 1 | < 0.1% |
| 42.64154 | 1 | < 0.1% |
| 42.318317 | 1 | < 0.1% |
| 42.107204 | 1 | < 0.1% |
| 41.91661 | 1 | < 0.1% |
| 41.34796 | 1 | < 0.1% |
| 41.258785 | 1 | < 0.1% |
| 41.12615 | 5 | |
| 41.12421 | 1 | < 0.1% |
| 41.061634 | 2 | < 0.1% |
LONGITUDE
Real number (ℝ)
Missing 
| Distinct | 99084 |
|---|---|
| Distinct (%) | 5.2% |
| Missing | 239440 |
| Missing (%) | 11.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.7449 |
| Minimum | -201.35999 |
|---|---|
| Maximum | 0 |
| Zeros | 4677 |
| Zeros (%) | 0.2% |
| Negative | 1894931 |
| Negative (%) | 88.6% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | -201.35999 |
|---|---|
| 5-th percentile | -74.037093 |
| Q1 | -73.9747 |
| median | -73.92709 |
| Q3 | -73.866761 |
| 95-th percentile | -73.76318 |
| Maximum | 0 |
| Range | 201.35999 |
| Interquartile range (IQR) | 0.1079388 |
Descriptive statistics
| Standard deviation | 3.7881604 |
|---|---|
| Coefficient of variation (CV) | -0.051368439 |
| Kurtosis | 422.31976 |
| Mean | -73.7449 |
| Median Absolute Deviation (MAD) | 0.0525804 |
| Skewness | 16.047859 |
| Sum | -1.400864 × 108 |
| Variance | 14.350159 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4677 | 0.2% |
| -73.89063 | 790 | < 0.1% |
| -73.91282 | 719 | < 0.1% |
| -73.98453 | 708 | < 0.1% |
| -73.89686 | 684 | < 0.1% |
| -74.038086 | 682 | < 0.1% |
| -73.91243 | 656 | < 0.1% |
| -73.94476 | 618 | < 0.1% |
| -73.9112 | 592 | < 0.1% |
| -73.9845292 | 587 | < 0.1% |
| Other values (99074) | 1888895 | |
| (Missing) | 239440 | 11.2% |
| Value | Count | Frequency (%) |
| -201.35999 | 1 | < 0.1% |
| -201.23706 | 105 | |
| -89.13527 | 1 | < 0.1% |
| -86.76847 | 1 | < 0.1% |
| -79.61955 | 1 | < 0.1% |
| -79.00183 | 1 | < 0.1% |
| -76.2634 | 1 | < 0.1% |
| -76.02163 | 1 | < 0.1% |
| -74.742 | 7 | < 0.1% |
| -74.25496 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4677 | |
| -32.768513 | 16 | < 0.1% |
| -47.209625 | 3 | < 0.1% |
| -73.66301 | 1 | < 0.1% |
| -73.70055 | 2 | < 0.1% |
| -73.700584 | 11 | < 0.1% |
| -73.7005968 | 10 | < 0.1% |
| -73.70061 | 5 | < 0.1% |
| -73.70071 | 4 | < 0.1% |
| -73.70073 | 1 | < 0.1% |
LOCATION
Text
Missing 
| Distinct | 297878 |
|---|---|
| Distinct (%) | 15.7% |
| Missing | 239440 |
| Missing (%) | 11.2% |
| Memory size | 16.3 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 24 |
| Mean length | 22.746237 |
| Min length | 10 |
Unique
| Unique | 166390 ? |
|---|---|
| Unique (%) | 8.8% |
Sample
| 1st row | (40.667202, -73.8665) |
|---|---|
| 2nd row | (40.683304, -73.917274) |
| 3rd row | (40.709183, -73.956825) |
| 4th row | (40.86816, -73.83148) |
| 5th row | (40.67172, -73.8971) |
| Value | Count | Frequency (%) |
| 0.0 | 9354 | 0.2% |
| 40.861862 | 918 | < 0.1% |
| 40.696033 | 793 | < 0.1% |
| 73.89063 | 790 | < 0.1% |
| 73.91282 | 719 | < 0.1% |
| 73.98453 | 708 | < 0.1% |
| 40.8047 | 693 | < 0.1% |
| 73.89686 | 684 | < 0.1% |
| 74.038086 | 682 | < 0.1% |
| 40.608757 | 681 | < 0.1% |
| Other values (226721) | 3783194 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 4730794 | |
| 4 | 4101049 | 9.5% |
| . | 3799216 | 8.8% |
| 3 | 3601441 | 8.3% |
| 0 | 3502505 | 8.1% |
| 9 | 2775395 | 6.4% |
| 8 | 2725726 | 6.3% |
| 6 | 2694721 | 6.2% |
| 5 | 2156181 | 5.0% |
| ) | 1899608 | 4.4% |
| Other values (6) | 11222297 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 43208933 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 7 | 4730794 | |
| 4 | 4101049 | 9.5% |
| . | 3799216 | 8.8% |
| 3 | 3601441 | 8.3% |
| 0 | 3502505 | 8.1% |
| 9 | 2775395 | 6.4% |
| 8 | 2725726 | 6.3% |
| 6 | 2694721 | 6.2% |
| 5 | 2156181 | 5.0% |
| ) | 1899608 | 4.4% |
| Other values (6) | 11222297 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 43208933 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 7 | 4730794 | |
| 4 | 4101049 | 9.5% |
| . | 3799216 | 8.8% |
| 3 | 3601441 | 8.3% |
| 0 | 3502505 | 8.1% |
| 9 | 2775395 | 6.4% |
| 8 | 2725726 | 6.3% |
| 6 | 2694721 | 6.2% |
| 5 | 2156181 | 5.0% |
| ) | 1899608 | 4.4% |
| Other values (6) | 11222297 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 43208933 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 7 | 4730794 | |
| 4 | 4101049 | 9.5% |
| . | 3799216 | 8.8% |
| 3 | 3601441 | 8.3% |
| 0 | 3502505 | 8.1% |
| 9 | 2775395 | 6.4% |
| 8 | 2725726 | 6.3% |
| 6 | 2694721 | 6.2% |
| 5 | 2156181 | 5.0% |
| ) | 1899608 | 4.4% |
| Other values (6) | 11222297 |
ON STREET NAME
Text
Missing 
| Distinct | 20297 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 458746 |
| Missing (%) | 21.4% |
| Memory size | 16.3 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 29.208227 |
| Min length | 2 |
Unique
| Unique | 7380 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | WHITESTONE EXPRESSWAY |
|---|---|
| 2nd row | QUEENSBORO BRIDGE UPPER |
| 3rd row | THROGS NECK BRIDGE |
| 4th row | SARATOGA AVENUE |
| 5th row | MAJOR DEEGAN EXPRESSWAY RAMP |
| Value | Count | Frequency (%) |
| avenue | 622140 | 16.0% |
| street | 532426 | 13.7% |
| east | 156743 | 4.0% |
| boulevard | 129739 | 3.3% |
| west | 117202 | 3.0% |
| parkway | 77408 | 2.0% |
| road | 69620 | 1.8% |
| expressway | 66041 | 1.7% |
| island | 31656 | 0.8% |
| queens | 28007 | 0.7% |
| Other values (5424) | 2046868 |
Most occurring characters
| Value | Count | Frequency (%) |
| 27622941 | ||
| E | 3767585 | 7.7% |
| A | 2005980 | 4.1% |
| T | 1877599 | 3.8% |
| R | 1716542 | 3.5% |
| N | 1466857 | 3.0% |
| S | 1447403 | 2.9% |
| U | 1001507 | 2.0% |
| O | 893179 | 1.8% |
| V | 875073 | 1.8% |
| Other values (65) | 6403977 | 13.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 49078643 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 27622941 | ||
| E | 3767585 | 7.7% |
| A | 2005980 | 4.1% |
| T | 1877599 | 3.8% |
| R | 1716542 | 3.5% |
| N | 1466857 | 3.0% |
| S | 1447403 | 2.9% |
| U | 1001507 | 2.0% |
| O | 893179 | 1.8% |
| V | 875073 | 1.8% |
| Other values (65) | 6403977 | 13.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 49078643 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 27622941 | ||
| E | 3767585 | 7.7% |
| A | 2005980 | 4.1% |
| T | 1877599 | 3.8% |
| R | 1716542 | 3.5% |
| N | 1466857 | 3.0% |
| S | 1447403 | 2.9% |
| U | 1001507 | 2.0% |
| O | 893179 | 1.8% |
| V | 875073 | 1.8% |
| Other values (65) | 6403977 | 13.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 49078643 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 27622941 | ||
| E | 3767585 | 7.7% |
| A | 2005980 | 4.1% |
| T | 1877599 | 3.8% |
| R | 1716542 | 3.5% |
| N | 1466857 | 3.0% |
| S | 1447403 | 2.9% |
| U | 1001507 | 2.0% |
| O | 893179 | 1.8% |
| V | 875073 | 1.8% |
| Other values (65) | 6403977 | 13.0% |
Missing 
| Distinct | 22031 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 815476 |
| Missing (%) | 38.1% |
| Memory size | 16.3 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 31 |
| Mean length | 22.458799 |
| Min length | 1 |
Unique
| Unique | 7001 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 20 AVENUE |
|---|---|
| 2nd row | DECATUR STREET |
| 3rd row | EAST 43 STREET |
| 4th row | EAST GATE PLAZA |
| 5th row | west 80 street -west 81 street |
| Value | Count | Frequency (%) |
| avenue | 577527 | 19.7% |
| street | 468587 | 16.0% |
| east | 114391 | 3.9% |
| west | 72235 | 2.5% |
| boulevard | 70367 | 2.4% |
| road | 56758 | 1.9% |
| place | 34621 | 1.2% |
| parkway | 27321 | 0.9% |
| 3 | 19241 | 0.7% |
| park | 17806 | 0.6% |
| Other values (5526) | 1468653 |
Most occurring characters
| Value | Count | Frequency (%) |
| 14155131 | ||
| E | 3004574 | 10.1% |
| T | 1486246 | 5.0% |
| A | 1455639 | 4.9% |
| R | 1174676 | 4.0% |
| N | 1101186 | 3.7% |
| S | 1012508 | 3.4% |
| U | 795008 | 2.7% |
| V | 727329 | 2.4% |
| O | 593812 | 2.0% |
| Other values (66) | 4219728 | 14.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29725837 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 14155131 | ||
| E | 3004574 | 10.1% |
| T | 1486246 | 5.0% |
| A | 1455639 | 4.9% |
| R | 1174676 | 4.0% |
| N | 1101186 | 3.7% |
| S | 1012508 | 3.4% |
| U | 795008 | 2.7% |
| V | 727329 | 2.4% |
| O | 593812 | 2.0% |
| Other values (66) | 4219728 | 14.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29725837 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 14155131 | ||
| E | 3004574 | 10.1% |
| T | 1486246 | 5.0% |
| A | 1455639 | 4.9% |
| R | 1174676 | 4.0% |
| N | 1101186 | 3.7% |
| S | 1012508 | 3.4% |
| U | 795008 | 2.7% |
| V | 727329 | 2.4% |
| O | 593812 | 2.0% |
| Other values (66) | 4219728 | 14.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29725837 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 14155131 | ||
| E | 3004574 | 10.1% |
| T | 1486246 | 5.0% |
| A | 1455639 | 4.9% |
| R | 1174676 | 4.0% |
| N | 1101186 | 3.7% |
| S | 1012508 | 3.4% |
| U | 795008 | 2.7% |
| V | 727329 | 2.4% |
| O | 593812 | 2.0% |
| Other values (66) | 4219728 | 14.2% |
OFF STREET NAME
Text
Missing 
| Distinct | 238084 |
|---|---|
| Distinct (%) | 65.0% |
| Missing | 1772675 |
| Missing (%) | 82.9% |
| Memory size | 16.3 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 40 |
| Mean length | 35.365745 |
| Min length | 8 |
Unique
| Unique | 185679 ? |
|---|---|
| Unique (%) | 50.7% |
Sample
| 1st row | 1211 LORING AVENUE |
|---|---|
| 2nd row | 344 BAYCHESTER AVENUE |
| 3rd row | 2047 PITKIN AVENUE |
| 4th row | 480 DEAN STREET |
| 5th row | 878 FLATBUSH AVENUE |
| Value | Count | Frequency (%) |
| avenue | 144341 | 11.9% |
| street | 132139 | 10.9% |
| east | 34831 | 2.9% |
| west | 25197 | 2.1% |
| boulevard | 22997 | 1.9% |
| road | 17136 | 1.4% |
| lot | 7881 | 0.6% |
| parking | 7267 | 0.6% |
| parkway | 7265 | 0.6% |
| place | 7123 | 0.6% |
| Other values (27876) | 811117 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7002858 | ||
| E | 836211 | 6.5% |
| T | 458410 | 3.5% |
| A | 428269 | 3.3% |
| R | 355701 | 2.7% |
| N | 312373 | 2.4% |
| S | 300946 | 2.3% |
| 1 | 291801 | 2.3% |
| U | 212281 | 1.6% |
| V | 198958 | 1.5% |
| Other values (74) | 2559246 | 19.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12957054 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 7002858 | ||
| E | 836211 | 6.5% |
| T | 458410 | 3.5% |
| A | 428269 | 3.3% |
| R | 355701 | 2.7% |
| N | 312373 | 2.4% |
| S | 300946 | 2.3% |
| 1 | 291801 | 2.3% |
| U | 212281 | 1.6% |
| V | 198958 | 1.5% |
| Other values (74) | 2559246 | 19.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12957054 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 7002858 | ||
| E | 836211 | 6.5% |
| T | 458410 | 3.5% |
| A | 428269 | 3.3% |
| R | 355701 | 2.7% |
| N | 312373 | 2.4% |
| S | 300946 | 2.3% |
| 1 | 291801 | 2.3% |
| U | 212281 | 1.6% |
| V | 198958 | 1.5% |
| Other values (74) | 2559246 | 19.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12957054 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 7002858 | ||
| E | 836211 | 6.5% |
| T | 458410 | 3.5% |
| A | 428269 | 3.3% |
| R | 355701 | 2.7% |
| N | 312373 | 2.4% |
| S | 300946 | 2.3% |
| 1 | 291801 | 2.3% |
| U | 212281 | 1.6% |
| V | 198958 | 1.5% |
| Other values (74) | 2559246 | 19.8% |
NUMBER OF PERSONS INJURED
Real number (ℝ)
High correlation  Zeros 
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.31853971 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 1636505 |
| Zeros (%) | 76.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.70733463 |
|---|---|
| Coefficient of variation (CV) | 2.220554 |
| Kurtosis | 48.770161 |
| Mean | 0.31853971 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.1681048 |
| Sum | 681366 |
| Variance | 0.50032228 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1636505 | |
| 1 | 389983 | 18.2% |
| 2 | 73462 | 3.4% |
| 3 | 24056 | 1.1% |
| 4 | 8913 | 0.4% |
| 5 | 3413 | 0.2% |
| 6 | 1430 | 0.1% |
| 7 | 599 | < 0.1% |
| 8 | 267 | < 0.1% |
| 9 | 135 | < 0.1% |
| Other values (22) | 267 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1636505 | |
| 1 | 389983 | 18.2% |
| 2 | 73462 | 3.4% |
| 3 | 24056 | 1.1% |
| 4 | 8913 | 0.4% |
| 5 | 3413 | 0.2% |
| 6 | 1430 | 0.1% |
| 7 | 599 | < 0.1% |
| 8 | 267 | < 0.1% |
| 9 | 135 | < 0.1% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 3 | |
| 23 | 1 | < 0.1% |
| 22 | 3 |
NUMBER OF PERSONS KILLED
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0015404272 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 2135856 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.041432376 |
|---|---|
| Coefficient of variation (CV) | 26.896679 |
| Kurtosis | 1852.8104 |
| Mean | 0.0015404272 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 33.180902 |
| Sum | 3295 |
| Variance | 0.0017166418 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2135856 | |
| 1 | 3059 | 0.1% |
| 2 | 83 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| (Missing) | 31 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2135856 | |
| 1 | 3059 | 0.1% |
| 2 | 83 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 4 | < 0.1% |
| 5 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 4 | < 0.1% |
| 3 | 12 | < 0.1% |
| 2 | 83 | < 0.1% |
| 1 | 3059 | 0.1% |
| 0 | 2135856 |
NUMBER OF PEDESTRIANS INJURED
Real number (ℝ)
Zeros 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.057821517 |
| Minimum | 0 |
|---|---|
| Maximum | 27 |
| Zeros | 2020475 |
| Zeros (%) | 94.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 27 |
| Range | 27 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2465722 |
|---|---|
| Coefficient of variation (CV) | 4.2643676 |
| Kurtosis | 121.50453 |
| Mean | 0.057821517 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.5674996 |
| Sum | 123683 |
| Variance | 0.060797851 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2020475 | |
| 1 | 114212 | 5.3% |
| 2 | 3865 | 0.2% |
| 3 | 383 | < 0.1% |
| 4 | 62 | < 0.1% |
| 5 | 27 | < 0.1% |
| 6 | 11 | < 0.1% |
| 7 | 5 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2020475 | |
| 1 | 114212 | 5.3% |
| 2 | 3865 | 0.2% |
| 3 | 383 | < 0.1% |
| 4 | 62 | < 0.1% |
| 5 | 27 | < 0.1% |
| 6 | 11 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 27 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| 7 | 5 | < 0.1% |
| 6 | 11 | < 0.1% |
| 5 | 27 | |
| 4 | 62 |
NUMBER OF PEDESTRIANS KILLED
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.3 MiB |
| 0 | |
|---|---|
| 1 | 1590 |
| 2 | 13 |
| 6 | 1 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2137443 | |
| 1 | 1590 | 0.1% |
| 2 | 13 | < 0.1% |
| 6 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2137443 | |
| 1 | 1590 | 0.1% |
| 2 | 13 | < 0.1% |
| 6 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2137443 | |
| 1 | 1590 | 0.1% |
| 2 | 13 | < 0.1% |
| 6 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2139048 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2137443 | |
| 1 | 1590 | 0.1% |
| 2 | 13 | < 0.1% |
| 6 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2139048 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2137443 | |
| 1 | 1590 | 0.1% |
| 2 | 13 | < 0.1% |
| 6 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2139048 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2137443 | |
| 1 | 1590 | 0.1% |
| 2 | 13 | < 0.1% |
| 6 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
NUMBER OF CYCLIST INJURED
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.3 MiB |
| 0 | |
|---|---|
| 1 | 58250 |
| 2 | 673 |
| 3 | 24 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2080100 | |
| 1 | 58250 | 2.7% |
| 2 | 673 | < 0.1% |
| 3 | 24 | < 0.1% |
| 4 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2080100 | |
| 1 | 58250 | 2.7% |
| 2 | 673 | < 0.1% |
| 3 | 24 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2080100 | |
| 1 | 58250 | 2.7% |
| 2 | 673 | < 0.1% |
| 3 | 24 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2139048 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2080100 | |
| 1 | 58250 | 2.7% |
| 2 | 673 | < 0.1% |
| 3 | 24 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2139048 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2080100 | |
| 1 | 58250 | 2.7% |
| 2 | 673 | < 0.1% |
| 3 | 24 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2139048 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2080100 | |
| 1 | 58250 | 2.7% |
| 2 | 673 | < 0.1% |
| 3 | 24 | < 0.1% |
| 4 | 1 | < 0.1% |
NUMBER OF CYCLIST KILLED
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.3 MiB |
| 0 | |
|---|---|
| 1 | 256 |
| 2 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2138791 | |
| 1 | 256 | < 0.1% |
| 2 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2138791 | |
| 1 | 256 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2138791 | |
| 1 | 256 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2139048 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2138791 | |
| 1 | 256 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2139048 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2138791 | |
| 1 | 256 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2139048 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2138791 | |
| 1 | 256 | < 0.1% |
| 2 | 1 | < 0.1% |
NUMBER OF MOTORIST INJURED
Real number (ℝ)
High correlation  Zeros 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.22864143 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 1819309 |
| Zeros (%) | 85.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.66852401 |
|---|---|
| Coefficient of variation (CV) | 2.923897 |
| Kurtosis | 60.610294 |
| Mean | 0.22864143 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.0249595 |
| Sum | 489075 |
| Variance | 0.44692435 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1819309 | |
| 1 | 214875 | 10.0% |
| 2 | 66859 | 3.1% |
| 3 | 23319 | 1.1% |
| 4 | 8727 | 0.4% |
| 5 | 3360 | 0.2% |
| 6 | 1382 | 0.1% |
| 7 | 573 | < 0.1% |
| 8 | 258 | < 0.1% |
| 9 | 130 | < 0.1% |
| Other values (21) | 256 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1819309 | |
| 1 | 214875 | 10.0% |
| 2 | 66859 | 3.1% |
| 3 | 23319 | 1.1% |
| 4 | 8727 | 0.4% |
| 5 | 3360 | 0.2% |
| 6 | 1382 | 0.1% |
| 7 | 573 | < 0.1% |
| 8 | 258 | < 0.1% |
| 9 | 130 | < 0.1% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 3 | |
| 23 | 1 | < 0.1% |
| 22 | 2 | |
| 21 | 1 | < 0.1% |
NUMBER OF MOTORIST KILLED
Real number (ℝ)
High correlation  Skewed  Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.00063532936 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 2137793 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.027572005 |
|---|---|
| Coefficient of variation (CV) | 43.397971 |
| Kurtosis | 4006.7511 |
| Mean | 0.00063532936 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 53.526419 |
| Sum | 1359 |
| Variance | 0.00076021545 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2137793 | |
| 1 | 1173 | 0.1% |
| 2 | 66 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2137793 | |
| 1 | 1173 | 0.1% |
| 2 | 66 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 2 | < 0.1% |
| 4 | 2 | < 0.1% |
| 3 | 12 | < 0.1% |
| 2 | 66 | < 0.1% |
| 1 | 1173 | 0.1% |
| 0 | 2137793 |
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7247 |
| Missing (%) | 0.3% |
| Memory size | 16.3 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 43 |
| Mean length | 19.558659 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Aggressive Driving/Road Rage |
|---|---|
| 2nd row | Pavement Slippery |
| 3rd row | Following Too Closely |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 722522 | |
| driver | 464742 | 10.9% |
| inattention/distraction | 430808 | 10.1% |
| closely | 168527 | 4.0% |
| too | 168527 | 4.0% |
| to | 153071 | 3.6% |
| failure | 133931 | 3.1% |
| yield | 127534 | 3.0% |
| right-of-way | 127534 | 3.0% |
| following | 114773 | 2.7% |
| Other values (96) | 1648193 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4683319 | 11.2% |
| e | 4243184 | 10.2% |
| n | 3625879 | 8.7% |
| t | 2898877 | 7.0% |
| o | 2465185 | 5.9% |
| r | 2457408 | 5.9% |
| s | 2162592 | 5.2% |
| 2128361 | 5.1% | |
| a | 2061170 | 4.9% |
| c | 1600117 | 3.8% |
| Other values (45) | 13369077 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 41695169 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 4683319 | 11.2% |
| e | 4243184 | 10.2% |
| n | 3625879 | 8.7% |
| t | 2898877 | 7.0% |
| o | 2465185 | 5.9% |
| r | 2457408 | 5.9% |
| s | 2162592 | 5.2% |
| 2128361 | 5.1% | |
| a | 2061170 | 4.9% |
| c | 1600117 | 3.8% |
| Other values (45) | 13369077 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 41695169 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 4683319 | 11.2% |
| e | 4243184 | 10.2% |
| n | 3625879 | 8.7% |
| t | 2898877 | 7.0% |
| o | 2465185 | 5.9% |
| r | 2457408 | 5.9% |
| s | 2162592 | 5.2% |
| 2128361 | 5.1% | |
| a | 2061170 | 4.9% |
| c | 1600117 | 3.8% |
| Other values (45) | 13369077 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 41695169 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 4683319 | 11.2% |
| e | 4243184 | 10.2% |
| n | 3625879 | 8.7% |
| t | 2898877 | 7.0% |
| o | 2465185 | 5.9% |
| r | 2457408 | 5.9% |
| s | 2162592 | 5.2% |
| 2128361 | 5.1% | |
| a | 2061170 | 4.9% |
| c | 1600117 | 3.8% |
| Other values (45) | 13369077 |
CONTRIBUTING FACTOR VEHICLE 2
Text
Missing 
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 336447 |
| Missing (%) | 15.7% |
| Memory size | 16.3 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 13.05409 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 1517571 | |
| driver | 103988 | 4.7% |
| inattention/distraction | 97069 | 4.4% |
| other | 33942 | 1.5% |
| vehicular | 32876 | 1.5% |
| too | 28805 | 1.3% |
| closely | 28805 | 1.3% |
| passing | 22300 | 1.0% |
| to | 22054 | 1.0% |
| lane | 20754 | 0.9% |
| Other values (96) | 304097 | 13.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3708448 | |
| e | 3610400 | |
| n | 2109289 | |
| s | 1807319 | |
| c | 1712694 | |
| d | 1593491 | |
| p | 1589343 | |
| f | 1575663 | |
| U | 1555327 | |
| t | 637147 | 2.7% |
| Other values (45) | 3632194 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 23531315 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 3708448 | |
| e | 3610400 | |
| n | 2109289 | |
| s | 1807319 | |
| c | 1712694 | |
| d | 1593491 | |
| p | 1589343 | |
| f | 1575663 | |
| U | 1555327 | |
| t | 637147 | 2.7% |
| Other values (45) | 3632194 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 23531315 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 3708448 | |
| e | 3610400 | |
| n | 2109289 | |
| s | 1807319 | |
| c | 1712694 | |
| d | 1593491 | |
| p | 1589343 | |
| f | 1575663 | |
| U | 1555327 | |
| t | 637147 | 2.7% |
| Other values (45) | 3632194 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 23531315 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 3708448 | |
| e | 3610400 | |
| n | 2109289 | |
| s | 1807319 | |
| c | 1712694 | |
| d | 1593491 | |
| p | 1589343 | |
| f | 1575663 | |
| U | 1555327 | |
| t | 637147 | 2.7% |
| Other values (45) | 3632194 |
CONTRIBUTING FACTOR VEHICLE 3
Text
Missing 
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1985155 |
| Missing (%) | 92.8% |
| Memory size | 16.3 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 11.658977 |
| Min length | 1 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 143435 | |
| other | 2964 | 1.8% |
| vehicular | 2924 | 1.7% |
| driver | 2229 | 1.3% |
| closely | 2093 | 1.3% |
| too | 2093 | 1.3% |
| inattention/distraction | 2040 | 1.2% |
| following | 2036 | 1.2% |
| fatigued/drowsy | 853 | 0.5% |
| pavement | 418 | 0.2% |
| Other values (80) | 6145 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 306610 | |
| i | 305180 | |
| n | 157369 | |
| s | 150660 | |
| c | 150109 | |
| d | 145596 | |
| p | 145157 | |
| f | 144378 | |
| U | 144141 | |
| o | 17939 | 1.0% |
| Other values (45) | 127096 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1794235 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 306610 | |
| i | 305180 | |
| n | 157369 | |
| s | 150660 | |
| c | 150109 | |
| d | 145596 | |
| p | 145157 | |
| f | 144378 | |
| U | 144141 | |
| o | 17939 | 1.0% |
| Other values (45) | 127096 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1794235 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 306610 | |
| i | 305180 | |
| n | 157369 | |
| s | 150660 | |
| c | 150109 | |
| d | 145596 | |
| p | 145157 | |
| f | 144378 | |
| U | 144141 | |
| o | 17939 | 1.0% |
| Other values (45) | 127096 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1794235 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 306610 | |
| i | 305180 | |
| n | 157369 | |
| s | 150660 | |
| c | 150109 | |
| d | 145596 | |
| p | 145157 | |
| f | 144378 | |
| U | 144141 | |
| o | 17939 | 1.0% |
| Other values (45) | 127096 |
CONTRIBUTING FACTOR VEHICLE 4
Categorical
High correlation  Imbalance  Missing 
| Distinct | 42 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2104055 |
| Missing (%) | 98.4% |
| Memory size | 16.3 MiB |
| Unspecified | |
|---|---|
| Other Vehicular | 654 |
| Following Too Closely | 403 |
| Driver Inattention/Distraction | 289 |
| Fatigued/Drowsy | 170 |
| Other values (37) | 468 |
Length
| Max length | 43 |
|---|---|
| Median length | 11 |
| Mean length | 11.489927 |
| Min length | 5 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
Common Values
| Value | Count | Frequency (%) |
| Unspecified | 33009 | 1.5% |
| Other Vehicular | 654 | < 0.1% |
| Following Too Closely | 403 | < 0.1% |
| Driver Inattention/Distraction | 289 | < 0.1% |
| Fatigued/Drowsy | 170 | < 0.1% |
| Pavement Slippery | 120 | < 0.1% |
| Reaction to Uninvolved Vehicle | 43 | < 0.1% |
| Unsafe Speed | 34 | < 0.1% |
| Outside Car Distraction | 31 | < 0.1% |
| Driver Inexperience | 30 | < 0.1% |
| Other values (32) | 210 | < 0.1% |
| (Missing) | 2104055 |
Length
| Value | Count | Frequency (%) |
| unspecified | 33009 | |
| other | 663 | 1.8% |
| vehicular | 654 | 1.7% |
| too | 408 | 1.1% |
| closely | 408 | 1.1% |
| following | 403 | 1.1% |
| driver | 319 | 0.9% |
| inattention/distraction | 289 | 0.8% |
| fatigued/drowsy | 170 | 0.5% |
| pavement | 123 | 0.3% |
| Other values (65) | 1009 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 69768 | |
| i | 69115 | |
| n | 35182 | |
| c | 34238 | |
| s | 34200 | |
| p | 33386 | |
| d | 33372 | |
| f | 33139 | |
| U | 33121 | |
| o | 3186 | 0.8% |
| Other values (41) | 23360 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 402067 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 69768 | |
| i | 69115 | |
| n | 35182 | |
| c | 34238 | |
| s | 34200 | |
| p | 33386 | |
| d | 33372 | |
| f | 33139 | |
| U | 33121 | |
| o | 3186 | 0.8% |
| Other values (41) | 23360 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 402067 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 69768 | |
| i | 69115 | |
| n | 35182 | |
| c | 34238 | |
| s | 34200 | |
| p | 33386 | |
| d | 33372 | |
| f | 33139 | |
| U | 33121 | |
| o | 3186 | 0.8% |
| Other values (41) | 23360 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 402067 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 69768 | |
| i | 69115 | |
| n | 35182 | |
| c | 34238 | |
| s | 34200 | |
| p | 33386 | |
| d | 33372 | |
| f | 33139 | |
| U | 33121 | |
| o | 3186 | 0.8% |
| Other values (41) | 23360 | 5.8% |
CONTRIBUTING FACTOR VEHICLE 5
Categorical
High correlation  Imbalance  Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2129500 |
| Missing (%) | 99.6% |
| Memory size | 16.3 MiB |
| Unspecified | |
|---|---|
| Other Vehicular | 193 |
| Following Too Closely | 104 |
| Driver Inattention/Distraction | 67 |
| Pavement Slippery | 50 |
| Other values (26) | 133 |
Length
| Max length | 43 |
|---|---|
| Median length | 11 |
| Mean length | 11.467114 |
| Min length | 5 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
Common Values
| Value | Count | Frequency (%) |
| Unspecified | 9001 | 0.4% |
| Other Vehicular | 193 | < 0.1% |
| Following Too Closely | 104 | < 0.1% |
| Driver Inattention/Distraction | 67 | < 0.1% |
| Pavement Slippery | 50 | < 0.1% |
| Fatigued/Drowsy | 41 | < 0.1% |
| Reaction to Uninvolved Vehicle | 12 | < 0.1% |
| Alcohol Involvement | 11 | < 0.1% |
| Obstruction/Debris | 10 | < 0.1% |
| Driver Inexperience | 10 | < 0.1% |
| Other values (21) | 49 | < 0.1% |
| (Missing) | 2129500 |
Length
| Value | Count | Frequency (%) |
| unspecified | 9001 | |
| other | 195 | 1.9% |
| vehicular | 193 | 1.9% |
| too | 106 | 1.0% |
| closely | 106 | 1.0% |
| following | 104 | 1.0% |
| driver | 77 | 0.8% |
| inattention/distraction | 67 | 0.7% |
| pavement | 51 | 0.5% |
| slippery | 50 | 0.5% |
| Other values (48) | 256 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 19065 | |
| i | 18811 | |
| n | 9549 | |
| c | 9340 | |
| s | 9285 | |
| p | 9129 | |
| d | 9087 | |
| f | 9028 | |
| U | 9024 | |
| o | 818 | 0.7% |
| Other values (40) | 6352 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 109488 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 19065 | |
| i | 18811 | |
| n | 9549 | |
| c | 9340 | |
| s | 9285 | |
| p | 9129 | |
| d | 9087 | |
| f | 9028 | |
| U | 9024 | |
| o | 818 | 0.7% |
| Other values (40) | 6352 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 109488 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 19065 | |
| i | 18811 | |
| n | 9549 | |
| c | 9340 | |
| s | 9285 | |
| p | 9129 | |
| d | 9087 | |
| f | 9028 | |
| U | 9024 | |
| o | 818 | 0.7% |
| Other values (40) | 6352 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 109488 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 19065 | |
| i | 18811 | |
| n | 9549 | |
| c | 9340 | |
| s | 9285 | |
| p | 9129 | |
| d | 9087 | |
| f | 9028 | |
| U | 9024 | |
| o | 818 | 0.7% |
| Other values (40) | 6352 | 5.8% |
COLLISION_ID
Real number (ℝ)
Unique 
| Distinct | 2139048 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3206744.8 |
| Minimum | 22 |
|---|---|
| Maximum | 4775840 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.3 MiB |
Quantile statistics
| Minimum | 22 |
|---|---|
| 5-th percentile | 107815.35 |
| Q1 | 3170881.8 |
| median | 3705786.5 |
| Q3 | 4240781.2 |
| 95-th percentile | 4668661.7 |
| Maximum | 4775840 |
| Range | 4775818 |
| Interquartile range (IQR) | 1069899.5 |
Descriptive statistics
| Standard deviation | 1506827.2 |
|---|---|
| Coefficient of variation (CV) | 0.46989307 |
| Kurtosis | 0.052063161 |
| Mean | 3206744.8 |
| Median Absolute Deviation (MAD) | 534950 |
| Skewness | -1.2425404 |
| Sum | 6.859381 × 1012 |
| Variance | 2.2705281 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4775537 | 1 | < 0.1% |
| 4455765 | 1 | < 0.1% |
| 4513547 | 1 | < 0.1% |
| 4541903 | 1 | < 0.1% |
| 4456314 | 1 | < 0.1% |
| 4486609 | 1 | < 0.1% |
| 4407458 | 1 | < 0.1% |
| 4486555 | 1 | < 0.1% |
| 4775649 | 1 | < 0.1% |
| 4775076 | 1 | < 0.1% |
| Other values (2139038) | 2139038 |
| Value | Count | Frequency (%) |
| 22 | 1 | |
| 23 | 1 | |
| 24 | 1 | |
| 25 | 1 | |
| 26 | 1 | |
| 27 | 1 | |
| 28 | 1 | |
| 29 | 1 | |
| 30 | 1 | |
| 31 | 1 |
| Value | Count | Frequency (%) |
| 4775840 | 1 | |
| 4775835 | 1 | |
| 4775832 | 1 | |
| 4775820 | 1 | |
| 4775817 | 1 | |
| 4775815 | 1 | |
| 4775810 | 1 | |
| 4775809 | 1 | |
| 4775807 | 1 | |
| 4775801 | 1 |
| Distinct | 1740 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 14731 |
| Missing (%) | 0.7% |
| Memory size | 16.3 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 35 |
| Mean length | 16.858735 |
| Min length | 1 |
Unique
| Unique | 1057 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Sedan |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Dump |
| Value | Count | Frequency (%) |
| vehicle | 902084 | |
| utility | 655616 | |
| station | 655572 | |
| sedan | 647727 | |
| wagon/sport | 475280 | |
| passenger | 416223 | |
| 181733 | 3.6% | |
| wagon | 180357 | 3.6% |
| sport | 180291 | 3.6% |
| truck | 89093 | 1.8% |
| Other values (1003) | 630045 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2902924 | 8.1% | |
| S | 2807922 | 7.8% |
| t | 2412208 | 6.7% |
| i | 2031525 | 5.7% |
| E | 1820010 | 5.1% |
| a | 1696201 | 4.7% |
| e | 1689518 | 4.7% |
| n | 1621382 | 4.5% |
| o | 1507025 | 4.2% |
| T | 1147439 | 3.2% |
| Other values (67) | 16177144 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 35813298 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2902924 | 8.1% | |
| S | 2807922 | 7.8% |
| t | 2412208 | 6.7% |
| i | 2031525 | 5.7% |
| E | 1820010 | 5.1% |
| a | 1696201 | 4.7% |
| e | 1689518 | 4.7% |
| n | 1621382 | 4.5% |
| o | 1507025 | 4.2% |
| T | 1147439 | 3.2% |
| Other values (67) | 16177144 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 35813298 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2902924 | 8.1% | |
| S | 2807922 | 7.8% |
| t | 2412208 | 6.7% |
| i | 2031525 | 5.7% |
| E | 1820010 | 5.1% |
| a | 1696201 | 4.7% |
| e | 1689518 | 4.7% |
| n | 1621382 | 4.5% |
| o | 1507025 | 4.2% |
| T | 1147439 | 3.2% |
| Other values (67) | 16177144 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 35813298 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2902924 | 8.1% | |
| S | 2807922 | 7.8% |
| t | 2412208 | 6.7% |
| i | 2031525 | 5.7% |
| E | 1820010 | 5.1% |
| a | 1696201 | 4.7% |
| e | 1689518 | 4.7% |
| n | 1621382 | 4.5% |
| o | 1507025 | 4.2% |
| T | 1147439 | 3.2% |
| Other values (67) | 16177144 |
Missing 
| Distinct | 1927 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 417714 |
| Missing (%) | 19.5% |
| Memory size | 16.3 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 30 |
| Mean length | 16.048236 |
| Min length | 1 |
Unique
| Unique | 1144 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Pick-up Truck |
| 3rd row | Sedan |
| 4th row | Tractor Truck Diesel |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 666333 | |
| utility | 479362 | |
| station | 479331 | |
| sedan | 452060 | |
| wagon/sport | 339127 | |
| passenger | 318613 | |
| 141595 | 3.6% | |
| wagon | 140261 | 3.6% |
| sport | 140204 | 3.6% |
| truck | 88557 | 2.3% |
| Other values (1051) | 670787 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2207864 | 8.0% | |
| S | 2073695 | 7.5% |
| t | 1731570 | 6.3% |
| i | 1488595 | 5.4% |
| E | 1440409 | 5.2% |
| e | 1240318 | 4.5% |
| a | 1210326 | 4.4% |
| n | 1150132 | 4.2% |
| o | 1104650 | 4.0% |
| T | 924311 | 3.3% |
| Other values (63) | 13052505 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 27624375 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2207864 | 8.0% | |
| S | 2073695 | 7.5% |
| t | 1731570 | 6.3% |
| i | 1488595 | 5.4% |
| E | 1440409 | 5.2% |
| e | 1240318 | 4.5% |
| a | 1210326 | 4.4% |
| n | 1150132 | 4.2% |
| o | 1104650 | 4.0% |
| T | 924311 | 3.3% |
| Other values (63) | 13052505 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 27624375 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2207864 | 8.0% | |
| S | 2073695 | 7.5% |
| t | 1731570 | 6.3% |
| i | 1488595 | 5.4% |
| E | 1440409 | 5.2% |
| e | 1240318 | 4.5% |
| a | 1210326 | 4.4% |
| n | 1150132 | 4.2% |
| o | 1104650 | 4.0% |
| T | 924311 | 3.3% |
| Other values (63) | 13052505 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 27624375 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2207864 | 8.0% | |
| S | 2073695 | 7.5% |
| t | 1731570 | 6.3% |
| i | 1488595 | 5.4% |
| E | 1440409 | 5.2% |
| e | 1240318 | 4.5% |
| a | 1210326 | 4.4% |
| n | 1150132 | 4.2% |
| o | 1104650 | 4.0% |
| T | 924311 | 3.3% |
| Other values (63) | 13052505 |
Missing 
| Distinct | 276 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1990930 |
| Missing (%) | 93.1% |
| Memory size | 16.3 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 17.666644 |
| Min length | 2 |
Unique
| Unique | 163 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Station Wagon/Sport Utility Vehicle |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 66321 | |
| utility | 51533 | |
| station | 51530 | |
| sedan | 49693 | |
| wagon/sport | 38171 | |
| passenger | 27716 | |
| 13443 | 3.7% | |
| wagon | 13359 | 3.7% |
| sport | 13358 | 3.7% |
| truck | 4583 | 1.3% |
| Other values (225) | 29153 |
Most occurring characters
| Value | Count | Frequency (%) |
| 211177 | 8.1% | |
| S | 207278 | 7.9% |
| t | 192358 | 7.4% |
| i | 158893 | 6.1% |
| a | 129793 | 5.0% |
| e | 129373 | 4.9% |
| n | 126987 | 4.9% |
| o | 117714 | 4.5% |
| E | 116431 | 4.4% |
| l | 77821 | 3.0% |
| Other values (52) | 1148923 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2616748 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 211177 | 8.1% | |
| S | 207278 | 7.9% |
| t | 192358 | 7.4% |
| i | 158893 | 6.1% |
| a | 129793 | 5.0% |
| e | 129373 | 4.9% |
| n | 126987 | 4.9% |
| o | 117714 | 4.5% |
| E | 116431 | 4.4% |
| l | 77821 | 3.0% |
| Other values (52) | 1148923 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2616748 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 211177 | 8.1% | |
| S | 207278 | 7.9% |
| t | 192358 | 7.4% |
| i | 158893 | 6.1% |
| a | 129793 | 5.0% |
| e | 129373 | 4.9% |
| n | 126987 | 4.9% |
| o | 117714 | 4.5% |
| E | 116431 | 4.4% |
| l | 77821 | 3.0% |
| Other values (52) | 1148923 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2616748 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 211177 | 8.1% | |
| S | 207278 | 7.9% |
| t | 192358 | 7.4% |
| i | 158893 | 6.1% |
| a | 129793 | 5.0% |
| e | 129373 | 4.9% |
| n | 126987 | 4.9% |
| o | 117714 | 4.5% |
| E | 116431 | 4.4% |
| l | 77821 | 3.0% |
| Other values (52) | 1148923 |
Missing 
| Distinct | 108 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2105307 |
| Missing (%) | 98.4% |
| Memory size | 16.3 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 18.007587 |
| Min length | 2 |
Unique
| Unique | 51 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Station Wagon/Sport Utility Vehicle |
|---|---|
| 2nd row | Sedan |
| 3rd row | Station Wagon/Sport Utility Vehicle |
| 4th row | Sedan |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 15533 | |
| utility | 12359 | |
| station | 12359 | |
| sedan | 12083 | |
| wagon/sport | 9507 | |
| passenger | 5970 | 7.2% |
| 2860 | 3.5% | |
| sport | 2852 | 3.5% |
| wagon | 2852 | 3.5% |
| truck | 844 | 1.0% |
| Other values (107) | 5159 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 48693 | 8.0% | |
| S | 48374 | 8.0% |
| t | 47762 | 7.9% |
| i | 39190 | 6.5% |
| a | 31788 | 5.2% |
| e | 31578 | 5.2% |
| n | 31253 | 5.1% |
| o | 29021 | 4.8% |
| E | 24673 | 4.1% |
| l | 19262 | 3.2% |
| Other values (48) | 256000 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 607594 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 48693 | 8.0% | |
| S | 48374 | 8.0% |
| t | 47762 | 7.9% |
| i | 39190 | 6.5% |
| a | 31788 | 5.2% |
| e | 31578 | 5.2% |
| n | 31253 | 5.1% |
| o | 29021 | 4.8% |
| E | 24673 | 4.1% |
| l | 19262 | 3.2% |
| Other values (48) | 256000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 607594 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 48693 | 8.0% | |
| S | 48374 | 8.0% |
| t | 47762 | 7.9% |
| i | 39190 | 6.5% |
| a | 31788 | 5.2% |
| e | 31578 | 5.2% |
| n | 31253 | 5.1% |
| o | 29021 | 4.8% |
| E | 24673 | 4.1% |
| l | 19262 | 3.2% |
| Other values (48) | 256000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 607594 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 48693 | 8.0% | |
| S | 48374 | 8.0% |
| t | 47762 | 7.9% |
| i | 39190 | 6.5% |
| a | 31788 | 5.2% |
| e | 31578 | 5.2% |
| n | 31253 | 5.1% |
| o | 29021 | 4.8% |
| E | 24673 | 4.1% |
| l | 19262 | 3.2% |
| Other values (48) | 256000 |
Missing 
| Distinct | 73 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 2129794 |
| Missing (%) | 99.6% |
| Memory size | 16.3 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 18.154636 |
| Min length | 2 |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Station Wagon/Sport Utility Vehicle |
|---|---|
| 2nd row | Station Wagon/Sport Utility Vehicle |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Station Wagon/Sport Utility Vehicle |
| Value | Count | Frequency (%) |
| vehicle | 4200 | |
| station | 3506 | |
| utility | 3506 | |
| sedan | 3427 | |
| wagon/sport | 2704 | |
| passenger | 1487 | 6.5% |
| 804 | 3.5% | |
| wagon | 804 | 3.5% |
| sport | 802 | 3.5% |
| truck | 261 | 1.1% |
| Other values (72) | 1236 | 5.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 13597 | 8.1% |
| 13493 | 8.0% | |
| S | 13331 | 7.9% |
| i | 11152 | 6.6% |
| a | 9025 | 5.4% |
| e | 8974 | 5.3% |
| n | 8898 | 5.3% |
| o | 8279 | 4.9% |
| E | 6130 | 3.6% |
| l | 5482 | 3.3% |
| Other values (45) | 69642 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 168003 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 13597 | 8.1% |
| 13493 | 8.0% | |
| S | 13331 | 7.9% |
| i | 11152 | 6.6% |
| a | 9025 | 5.4% |
| e | 8974 | 5.3% |
| n | 8898 | 5.3% |
| o | 8279 | 4.9% |
| E | 6130 | 3.6% |
| l | 5482 | 3.3% |
| Other values (45) | 69642 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 168003 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 13597 | 8.1% |
| 13493 | 8.0% | |
| S | 13331 | 7.9% |
| i | 11152 | 6.6% |
| a | 9025 | 5.4% |
| e | 8974 | 5.3% |
| n | 8898 | 5.3% |
| o | 8279 | 4.9% |
| E | 6130 | 3.6% |
| l | 5482 | 3.3% |
| Other values (45) | 69642 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 168003 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 13597 | 8.1% |
| 13493 | 8.0% | |
| S | 13331 | 7.9% |
| i | 11152 | 6.6% |
| a | 9025 | 5.4% |
| e | 8974 | 5.3% |
| n | 8898 | 5.3% |
| o | 8279 | 4.9% |
| E | 6130 | 3.6% |
| l | 5482 | 3.3% |
| Other values (45) | 69642 |
Interactions
Correlations
| BOROUGH | COLLISION_ID | CONTRIBUTING FACTOR VEHICLE 4 | CONTRIBUTING FACTOR VEHICLE 5 | LATITUDE | LONGITUDE | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BOROUGH | 1.000 | 0.055 | 0.052 | 0.047 | 0.006 | 0.006 | 0.028 | 0.001 | 0.008 | 0.004 | 0.002 | 0.000 | 0.008 | 0.002 |
| COLLISION_ID | 0.055 | 1.000 | 0.064 | 0.076 | -0.014 | 0.068 | 0.040 | 0.004 | 0.114 | 0.009 | 0.033 | 0.004 | 0.147 | 0.011 |
| CONTRIBUTING FACTOR VEHICLE 4 | 0.052 | 0.064 | 1.000 | 0.694 | 0.000 | 0.000 | 0.000 | 0.000 | 0.024 | 0.000 | 0.143 | 0.000 | 0.026 | 0.000 |
| CONTRIBUTING FACTOR VEHICLE 5 | 0.047 | 0.076 | 0.694 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.040 | 0.000 | 0.000 | 0.000 | 0.038 | 0.000 |
| LATITUDE | 0.006 | -0.014 | 0.000 | 0.000 | 1.000 | 0.285 | 0.002 | 0.000 | -0.032 | -0.001 | 0.003 | 0.000 | -0.026 | -0.001 |
| LONGITUDE | 0.006 | 0.068 | 0.000 | 0.000 | 0.285 | 1.000 | 0.002 | 0.000 | 0.075 | 0.006 | -0.014 | 0.000 | 0.039 | 0.003 |
| NUMBER OF CYCLIST INJURED | 0.028 | 0.040 | 0.000 | 0.000 | 0.002 | 0.002 | 1.000 | 0.018 | 0.004 | 0.001 | 0.000 | 0.002 | 0.004 | 0.005 |
| NUMBER OF CYCLIST KILLED | 0.001 | 0.004 | 0.000 | 0.000 | 0.000 | 0.000 | 0.018 | 1.000 | 0.000 | 0.000 | 0.167 | 0.707 | 0.040 | 0.736 |
| NUMBER OF MOTORIST INJURED | 0.008 | 0.114 | 0.024 | 0.040 | -0.032 | 0.075 | 0.004 | 0.000 | 1.000 | 0.018 | -0.090 | 0.008 | 0.782 | 0.008 |
| NUMBER OF MOTORIST KILLED | 0.004 | 0.009 | 0.000 | 0.000 | -0.001 | 0.006 | 0.001 | 0.000 | 0.018 | 1.000 | -0.004 | 0.017 | 0.012 | 0.627 |
| NUMBER OF PEDESTRIANS INJURED | 0.002 | 0.033 | 0.143 | 0.000 | 0.003 | -0.014 | 0.000 | 0.167 | -0.090 | -0.004 | 1.000 | 0.169 | 0.412 | -0.002 |
| NUMBER OF PEDESTRIANS KILLED | 0.000 | 0.004 | 0.000 | 0.000 | 0.000 | 0.000 | 0.002 | 0.707 | 0.008 | 0.017 | 0.169 | 1.000 | 0.036 | 0.693 |
| NUMBER OF PERSONS INJURED | 0.008 | 0.147 | 0.026 | 0.038 | -0.026 | 0.039 | 0.004 | 0.040 | 0.782 | 0.012 | 0.412 | 0.036 | 1.000 | 0.003 |
| NUMBER OF PERSONS KILLED | 0.002 | 0.011 | 0.000 | 0.000 | -0.001 | 0.003 | 0.005 | 0.736 | 0.008 | 0.627 | -0.002 | 0.693 | 0.003 | 1.000 |
Missing values
Sample
| CRASH DATE | CRASH TIME | BOROUGH | ZIP CODE | LATITUDE | LONGITUDE | LOCATION | ON STREET NAME | CROSS STREET NAME | OFF STREET NAME | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | CONTRIBUTING FACTOR VEHICLE 1 | CONTRIBUTING FACTOR VEHICLE 2 | CONTRIBUTING FACTOR VEHICLE 3 | CONTRIBUTING FACTOR VEHICLE 4 | CONTRIBUTING FACTOR VEHICLE 5 | COLLISION_ID | VEHICLE TYPE CODE 1 | VEHICLE TYPE CODE 2 | VEHICLE TYPE CODE 3 | VEHICLE TYPE CODE 4 | VEHICLE TYPE CODE 5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 09/11/2021 | 2:39 | NaN | NaN | NaN | NaN | NaN | WHITESTONE EXPRESSWAY | 20 AVENUE | NaN | 2.0 | 0.0 | 0 | 0 | 0 | 0 | 2 | 0 | Aggressive Driving/Road Rage | Unspecified | NaN | NaN | NaN | 4455765 | Sedan | Sedan | NaN | NaN | NaN |
| 1 | 03/26/2022 | 11:45 | NaN | NaN | NaN | NaN | NaN | QUEENSBORO BRIDGE UPPER | NaN | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Pavement Slippery | NaN | NaN | NaN | NaN | 4513547 | Sedan | NaN | NaN | NaN | NaN |
| 2 | 06/29/2022 | 6:55 | NaN | NaN | NaN | NaN | NaN | THROGS NECK BRIDGE | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Following Too Closely | Unspecified | NaN | NaN | NaN | 4541903 | Sedan | Pick-up Truck | NaN | NaN | NaN |
| 3 | 09/11/2021 | 9:35 | BROOKLYN | 11208.0 | 40.667202 | -73.866500 | (40.667202, -73.8665) | NaN | NaN | 1211 LORING AVENUE | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | NaN | NaN | NaN | NaN | 4456314 | Sedan | NaN | NaN | NaN | NaN |
| 4 | 12/14/2021 | 8:13 | BROOKLYN | 11233.0 | 40.683304 | -73.917274 | (40.683304, -73.917274) | SARATOGA AVENUE | DECATUR STREET | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | 4486609 | NaN | NaN | NaN | NaN | NaN |
| 5 | 04/14/2021 | 12:47 | NaN | NaN | NaN | NaN | NaN | MAJOR DEEGAN EXPRESSWAY RAMP | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | Unspecified | NaN | NaN | NaN | 4407458 | Dump | Sedan | NaN | NaN | NaN |
| 6 | 12/14/2021 | 17:05 | NaN | NaN | 40.709183 | -73.956825 | (40.709183, -73.956825) | BROOKLYN QUEENS EXPRESSWAY | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing Too Closely | Unspecified | NaN | NaN | NaN | 4486555 | Sedan | Tractor Truck Diesel | NaN | NaN | NaN |
| 7 | 12/14/2021 | 8:17 | BRONX | 10475.0 | 40.868160 | -73.831480 | (40.86816, -73.83148) | NaN | NaN | 344 BAYCHESTER AVENUE | 2.0 | 0.0 | 0 | 0 | 0 | 0 | 2 | 0 | Unspecified | Unspecified | NaN | NaN | NaN | 4486660 | Sedan | Sedan | NaN | NaN | NaN |
| 8 | 12/14/2021 | 21:10 | BROOKLYN | 11207.0 | 40.671720 | -73.897100 | (40.67172, -73.8971) | NaN | NaN | 2047 PITKIN AVENUE | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inexperience | Unspecified | NaN | NaN | NaN | 4487074 | Sedan | NaN | NaN | NaN | NaN |
| 9 | 12/14/2021 | 14:58 | MANHATTAN | 10017.0 | 40.751440 | -73.973970 | (40.75144, -73.97397) | 3 AVENUE | EAST 43 STREET | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing Too Closely | Unspecified | NaN | NaN | NaN | 4486519 | Sedan | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| CRASH DATE | CRASH TIME | BOROUGH | ZIP CODE | LATITUDE | LONGITUDE | LOCATION | ON STREET NAME | CROSS STREET NAME | OFF STREET NAME | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | CONTRIBUTING FACTOR VEHICLE 1 | CONTRIBUTING FACTOR VEHICLE 2 | CONTRIBUTING FACTOR VEHICLE 3 | CONTRIBUTING FACTOR VEHICLE 4 | CONTRIBUTING FACTOR VEHICLE 5 | COLLISION_ID | VEHICLE TYPE CODE 1 | VEHICLE TYPE CODE 2 | VEHICLE TYPE CODE 3 | VEHICLE TYPE CODE 4 | VEHICLE TYPE CODE 5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2139038 | 11/30/2024 | 20:40 | BROOKLYN | 11211.0 | 40.713100 | -73.957466 | (40.7131, -73.957466) | NaN | NaN | 291 GRAND ST | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Failure to Yield Right-of-Way | Unspecified | NaN | NaN | NaN | 4775771 | Sedan | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| 2139039 | 11/30/2024 | 23:00 | BROOKLYN | 11231.0 | 40.675102 | -74.001686 | (40.675102, -74.001686) | CLINTON ST | MILL ST | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing Too Closely | Unspecified | NaN | NaN | NaN | 4775359 | Sedan | NaN | NaN | NaN | NaN |
| 2139040 | 11/30/2024 | 9:58 | BROOKLYN | 11208.0 | 40.678535 | -73.875230 | (40.678535, -73.87523) | NaN | NaN | 1 WELLS ST | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Other Vehicular | Other Vehicular | Brakes Defective | NaN | NaN | 4775336 | Station Wagon/Sport Utility Vehicle | Sedan | Pick-up Truck | NaN | NaN |
| 2139041 | 11/26/2024 | 20:49 | MANHATTAN | 10031.0 | 40.823940 | -73.948555 | (40.82394, -73.948555) | W 143 ST | AMSTERDAM AVE | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Driver Inattention/Distraction | Unspecified | NaN | NaN | NaN | 4775721 | Taxi | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| 2139042 | 11/30/2024 | 15:25 | NaN | NaN | 40.704494 | -73.817430 | (40.704494, -73.81743) | VAN WYCK EXPWY | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | Unspecified | NaN | NaN | NaN | 4775503 | Station Wagon/Sport Utility Vehicle | Sedan | NaN | NaN | NaN |
| 2139043 | 11/30/2024 | 21:30 | QUEENS | 11373.0 | 40.742190 | -73.869545 | (40.74219, -73.869545) | NaN | NaN | 94-18 CORONA AVE | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | Unspecified | NaN | NaN | NaN | 4775566 | Station Wagon/Sport Utility Vehicle | Bike | NaN | NaN | NaN |
| 2139044 | 11/26/2024 | 12:55 | MANHATTAN | 10025.0 | 40.803566 | -73.967140 | (40.803566, -73.96714) | W 109 ST | BROADWAY | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Failure to Yield Right-of-Way | Unspecified | NaN | NaN | NaN | 4775621 | Bike | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| 2139045 | 11/30/2024 | 0:36 | NaN | NaN | 40.666435 | -73.834780 | (40.666435, -73.83478) | BELT PARKWAY | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | Following Too Closely | NaN | NaN | NaN | 4775484 | Sedan | Sedan | NaN | NaN | NaN |
| 2139046 | 11/29/2024 | 12:14 | QUEENS | 11373.0 | 40.741160 | -73.882706 | (40.74116, -73.882706) | NaN | NaN | 45-11 82 ST | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | Unspecified | NaN | NaN | NaN | 4775820 | Sedan | Box Truck | NaN | NaN | NaN |
| 2139047 | 11/30/2024 | 4:42 | QUEENS | 11435.0 | 40.698463 | -73.808205 | (40.698463, -73.808205) | NaN | NaN | 144-06 94 AVE | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Driver Inattention/Distraction | Unspecified | NaN | NaN | NaN | 4775537 | Sedan | Sedan | NaN | NaN | NaN |